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Abstract. We propose a new theoretical model for passively mobile Wireless Sensor Networks, called 
PM (standing for Passively mobile Machines). The main modification w.r.t. the Population Protocol 
model [2] is that agents now, instead of being automata, are Turing Machines. We provide general defi- 
nitions for unbounded memories, but we are mainly interested in computations upper-bounded by log n, 
with n being the size of the population (a space bound that clearly keeps our devices tiny). However, our 
results easily generalize to any space bound which is at least logn. We focus on complete communication 
graphs and define the complexity class PM SPACE(f(n)) consisting of all predicates on input assign- 
ments that are stably computable by some PM protocol that uses 0(f(n)) memory. We assume that the 
agents are initially identical and have no global knowledge of the system, which together with the com- 
pleteness of the communication graph implies that all computable predicates have to be symmetric. We 
show that there exists a PM protocol that can assign unique consecutive ids to the agents and inform 
them of the population size, only by using C(log n) memory. This allows us to give a direct simulation of 
any Deterministic Turing Machine of O(nlogn) space by PM protocols using O(logn) space. Going one 
step further, we generalize the simulation of the deterministic TM to prove that the PM model can sim- 
ulate any Nondeterministic TM of 0(n log n) space under the same space bounds. Moreover, by showing 
that a Nondeterministic TM of O(nlogn) space decides any language in PMSPACE(logn), we end 
up with an exact characterization for P M S P AC E(\ogn): it is precisely the class of all symmetric pred- 
icates in NSPACE(n\ogn). All these results easily generalize to the following exact characterizations 
for the classes P M S P AC E{f (n)) , for all f(n) = 0(\og n): they are precisely the classes of all symmetric 
predicates in NSPACE(nf(n)). In this way, we provide a space hierarchy for the PM model when the 
memory bounds are i7(log n). Finally, we explore the computability of the PM model when the protocols 
use o(loglogn) space and prove that SEMI LIN EAR = PMSPACE(f(n)) when f(n) = o(loglogn), 
where SEMILINEAR denotes the class of the semilinear predicates. In fact, we prove that this bound 
is tight, so that SEMILINEAR C PMSPACE(f(n)) when /(n) = O(loglogn). 

1 Introduction 

Theoretical models for Wireless Sensor Networks have received great attention over the past few years. 
Recently, Angluin et al. [2] proposed the Population Protocol (PP) model. Their aim was to model sensor 
networks consisting of tiny computational devices with sensing capabilities that follow some unpredictable 
and uncontrollable mobility pattern. Due to the minimalistic nature of their model, the class of computable 
predicates was proven to be fairly small: it is the class of semilinear predicates, thus, not including multi- 
plication of variables, exponentiations, and many other important operations on input variables. Moreover, 
Delporte-Gallet et al. [16] showed that PPs can tolerate only 0(1) crash failures and not even a single 
Byzantine agent. 

The work of Angluin et al. shed light and opened the way towards a brand new and very promising 
direction. The lack of control over the interaction pattern, as well as its inherent nondctcrminism, gave 
rise to a variety of new theoretical models for WSNs. Those models draw most of their beauty precisely 
from their inability to organize interactions in a convenient and predetermined way. In fact, the Population 
Protocol model was the minimalistic starting-point of this area of research. Most efforts are now towards 
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strengthening the model of Angluin et al. with extra realistic and implementable assumptions, in order to 
gain more computational power and/or speed-up the time to convergence and/or improve fault-tolerance [?], 
[18]- 

In this work, we think of each agent as being a Turing Machine. In particular, we propose a new theoretical 
model for passively mobile sensor networks, called the PM model. It is a model of Passively mobile Machines 
(that we keep calling agents) with sensing capabilities, equipped with two-way communication. We focus 
on PM protocols that use O(logn) memory since, having logarithmic communicating machines seems to be 
more natural than communicating automata of constant memory. First of all, the communicating machines 
assumption is perfectly consistent with current technology (cellphones, iPods, PDAs, and so on). Moreover, 
logarithmic is, in fact, extremely small. In addition, we explore the computability of the PM model on 
different space bounds in order to get an insight of the trade-off between computational power and resource 
(memory) availability. As will shall see, in PM protocols that use f(n) = J?(logn) space, agents can be 
organized into a distributed nondcterministic TM that makes use of all the available space. In the case, 
where f(n) = o(loglogn) however, we show that the PM protocols are computationally equal to Population 
Protocols. Thus, we provide exact characterizations for the input symmetric computations performed by 
communicating TMs using the above space bounds. 

1.1 Other Previous Work 

In [2], the Probabilistic Population Protocol model was proposed, in which the scheduler selects randomly and 
uniformly the next pair to interact. Some recent work has concentrated on performance, supported by this 
random scheduling assumption (see e.g. [4]). [10] proposed a generic definition of probabilistic schedulers 
and a collection of new fair schedulers, and revealed the need for the protocols to adapt when natural 
modifications of the mobility pattern occur. [9, 15] considered a huge population hypothesis (population 
going to infinity), and studied the dynamics, stability and computational power of probabilistic population 
protocols by exploiting the tools of continuous nonlinear dynamics. 

In addition, several extensions of the basic model have been proposed in order to more accurately reflect 
the requirements of practical systems. The Mediated Population Protocol (MPP) model of [13] was based on 
the assumption that each edge of the communication graph can store a state. It has been recently proven 
[11] that in the case of complete graphs the corresponding class is the symmetric subclass of NSPACE(n 2 ), 
rendering the MPP model extremely powerful. Gucrraoui and Ruppcrt [18] made another natural assumption: 
each agent has its own unique id and can store up to a constant number of other agents' ids. In this 
model, which they named Community Protocol model, the only permitted operation on ids is comparison. 
It was proven that the corresponding class consists of all symmetric predicates in N SPACE '(nlogn). In 
[1], Angluin et al. studied what properties of restricted communication graphs are stably computable, gave 
protocols for some of them, and proposed an extension of the model with stabilizing inputs. In [12], MPP's 
ability to decide graph properties was studied and it was proven that connectivity is undccidablc. Some 
other works incorporated agent failures [16] and gave to some agents slightly increased computational power 
[7] (heterogeneous systems). Recently, Bournez et al. [8] investigated the possibility of studying population 
protocols via game-theoretic approaches. For an excellent introduction to the subject of population protocols 
see [6] and for some recent advances mainly concerning mediated population protocols see [14]. 

2 Our Results - Roadmap 

In Section 3, we begin with a formal definition of the PM model. The section proceeds with a thorough 
description of the systems' functionality and then provides definitions of configurations and fair executions. 
In Section 4, first stable computation and then the complexity classes SSPACE(f(n)), S N S P AC E(f (n)) 
(symmetric predicates in SPACE(f(n)) and NSPACE(f( n)), respectively), and PMSPACE(f(n)) (sta- 
bly computable predicates by the PM model using 0(f{n)) space) are defined, while in section 5 we give 
two examples of PM protocols. In Section 6, we prove that PM protocols can assume the existence of unique 
consecutive ids and knowledge of the population size at the space cost of O(logn) (Theorem 2). In section 7 
we show how to exploit this knowledge in order for the system to simulate a non-deterministic TM (Theorem 
5). This, along with Theorem 3 provide an exact characterization of the PLM: PLM — SNSPACE(nf(n)) 
(Theorem 6). Based on the results of this section, we establish a space hierarchy theorem for the PM model, 
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when the corresponding protocols use O(logn) space (Theorem 12). In section 9 we examine the interesting 
case of the o(\oglogn) space bounded protocols, showing that it acts as a computability threshold of some 
sort. Finally, in Section 10 we conclude and discuss some future research directions. 

3 The Model 

In this section, we formally define the PM model and describe its functionality. In what follows, we denote 
by G = (V, E) the (directed) communication graph: V is the set of agents, or population, and E is the set of 
permissible ordered pairwise interactions between these agents. We provide definitions for general communi- 
cation graphs and unbounded memories, although in this work we deal with complete communication graphs 
only and are mainly interested in computations that are space-bounded by a logarithm of the population 
size. We generally denote by n the population size (i.e. n = \V\). 

Definition 1. A PM protocol is a 6-tuple (X, r, Q, S, 7, q ) where X, r and Q are all finite sets and 

1. X is the input alphabet, where U ^ X, 

2. r is the tape alphabet, where U E T and Icf, 

3. Q is the set of states, 

4- S : Q x r 4 — > Q x r 4 x {L, R} 4 x {0, 1} is the internal transition function, 

5. 7 : QxQ^QxQis the external transition function ( or interaction transition function ), and 

6. qo € Q is the initial state. 

Each agent is equipped with the following: 

— A sensor in order to sense its environment and receive a piece of the input. 

— Four read/ write tapes: the working tape, the output tape, the incoming message tape and the outgoing 
message tape. We assume that all tapes are bounded to the left and unbounded to the right. 

— A control unit that contains the state of the agent and applies the transition functions. 

— Four heads (one for each tape) that read from and write to the cells of the corresponding tapes and can 
move one step at a time, either to the left or to the right. 

— A binary working flag cither set to 1 meaning that the agent is working internally or to meaning that 
the agent is ready for interaction. 

Initially, all agents are in state qo, their working flag is set to 1, and all their cells contain the blank symbol 
U. We assume that all agents concurrently receive their sensed input (different agents may sense different 
data) as a response to a global start signal. The input to each agent is a symbol from X and is written on 
the leftmost cell of its working tape. 

When its working flag is set to 1 we can think of an agent working as a usual multitape Turing Machine 
(but it additionally writes the working flag). In particular, while the working flag is set to 1 the internal 
transition function 5 is applied, the control unit reads the symbols under the heads and its own state, updates 
all of them, moves each head one step to the left or to the right, and sets the working flag to or 1, according 
to S. 

As it is common in the population protocol literature, a fair adversary scheduler selects ordered pairs 
of agents (edges from E) to interact. Assume now that two agents u and v are about to interact with u 
being the initiator of the interaction and v being the responder. Let / : V — > {0, 1} be a function returning 
the current value of each agent's working flag. If at least one of /(it) and f(v) is equal to 1, then nothing 
happens, because at least one agent is still working internally. Otherwise (/(it) = f(v) = 0), both agents 
are ready and an interaction is established. In the latter case, the external transition function 7 is applied, 
the states of the agents are updated accordingly, the outgoing message of the initiator is copied to the 
leftmost cells of the incoming message tape of the responder (replacing its contents and writting U to all 
other previously non-blank cells) and vice versa (we call this the message swap), and finally the working 
flags of both agents are again set to 1. These operations could be handled by the protocols themselves, but 
then protocol descriptions would become awkward. So, we simply think of them as automatic operations 
performed by the hardware. These operations are also considered as atomic, that is, the interacting agents 
cannot take part in another interaction before the completion of these operations and, moreover, either all 
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operations totally succeed or are all totally aborted (in which case, the states of the interacting agents are 
restored) . 

Note that the assumption that the internal transition function S is only applied when the working flag 
is set to 1 is weak. In fact, an equivalent way to model this is to assume that S is of the form 5 : Q x 
r A x {0, 1} -> Q x r A x {L, R, S*} 4 x {0, 1}, that it is always applied, and that for all q £ Q and a £ -T 4 , 
S(q, a, 0) = (q, a, S* 4 , 0) is satisfied, where S means that the corresponding head "stays put" . The same holds 
for the assumptions that 7 is not applied if at least one of the interacting agents is working internally and 
that the working flags are set to 1 when some established interaction comes to an end; it is equivalent to 
an extended 7 of the form 7 : Q 2 x {0, l} 2 — > Q 2 x {0, l} 2 , that is applied in every interaction, and for 
which 7(<7i,g 2 ,/i,/2) = (91, 92, /1, .ft) if h = 1 or ft = 1, and 7(91, q 2 , /1, ft) = (71(91, 92), 72(91, 92), 1, 1) 
if /1 = ft =0, hold for all 91,92 £ Q, and we could also further extend S and 7 to handle the exchange of 
messages, but for sake of simplicity we have decided to leave such details out of the model. 

Since each agent is a TM, we use the notion of a configuration to capture its "state" . An agent configura- 
tion is a tuple (q,l w ,r w ,l ,r ,li m ,r im ,l om ,r om , /), where q£ Q, k,n £ T*, and / £ {0,1}. q is the state of 
the control unit, l w (l ,hm,lom) is the string of the working (output, incoming message, outgoing message) 
tape to the left of the head (including the symbol scanned), r w (r ,ri m ,r m) is the string of the working 
(output, incoming message, outgoing message) tape to the right of the head (excluding infinite sequences of 
blank cells), and / is the working flag indicating whether the agent is ready to interact (/ = 0) or carry- 
ing out some internal computation (/ = 1). Let B be the set of all agent configurations. Given two agent 
configurations A, A' £ B, we say that A yields A' if A' follows A by a single application of 6. 

A population configuration is a mapping C : V — > B, specifying the agent configuration of each agent in 
the population. Let C, C be population configurations and let u £ V. We say that C yields C via agent 
transition u, denoted C A C, if C(u) yields C'(u) and C'(w) = C(w), Vw £V — {u}. 

Denote by q(A) the state component of an agent configuration A and similarly for the other components 
(e.g. l w (A), ri m (A), f(A), and so on). Let s tp (A) — l tp (A)rt p (A) , that is, we obtain by concatenation the 
whole contents of tape tp £ {w, o, im, om}. Given a string s and 1 < i,j < \s\ denote by s[. . .i] its prefix 
sis 2 ■ ■ ■ Si and by s[j . . .] its suffix SjSj + i ■ ■ ■ «u. If i, j > \s\ then s[. . .i] = sl_P~l s l (i.e. i — \s\ blank sym- 
bols appended to s) and s[j . . .] = e. For any external transition 7(91,92) = (91,92) define 71(91,92) = q[ 
and 72(91,92) = 92- Given two population configurations C and C, we say that C yields C via encounter 
e = (u, v) e E, denoted C 4 C, if one of the following two cases holds: 

Case 1 (only for this case, we define C u = C(u) to avoid excessive number of parentheses): 

— f{C(u)) = f(C{v)) = 0, which guarantees that both agents u and v are ready for interaction under the 
population configuration C. 

- C'(u) = (71 (9(C), 9(C)), l w {C u ), r w (C u ), l (C u ), r a (C u ), s om (C v )[. . . \l im (C u )\], 

Som{C V )[\km{C U )\ + 1 • . .],L(C"), r om (C u ), 1), 

- C'( W ) = (72(9(C),9(C)),/ w (C-),r w (^),/ (^),r (^), Som (C«)[...|U(C)|], 

s om (C u )[\k m (C v )\ + l...}, l om (C v ),r om (C v ),l), and 

- C'(w) = C(w), Vw £ V - {u,v}. 

Case 2: 

— f{C(u)) = 1 or f(C(v)) — 1, which means that at least one agent between u and v is working internally 
under the population configuration C, and 

— C'(w) = C(w), Vw £ V. In this case no effective interaction takes place, thus the population configuration 
remains the same. 

Generally, we say that C yields (or can go in one step to) C, and write C — > C , if C A C for some 
e £ E (via encounter) or C A C for some u £V (via agent transition), or both. We say that C is reachable 
from C, and write C A C if there is a sequence of population configurations C — Co, C\, . . . , Ct — C such 
that Ci — > Cj+i holds for all « £ {0, 1, . . . , t — 1}. An execution is a finite or infinite sequence of population 
configurations Co, C\ . . . , so that Ci — > Cj+i. An infinite execution is /air if for all population configurations 
C, C such that C — > C, if C appears infinitely often then so does C . A computation is an infinite fair 
execution. 
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We assume that the input alphabet X, the tape alphabet r, and the set of states Q are all sets whose 
cardinality is fixed and independent of the population size (i.e. all of them are of cardinality 0(1)). Thus, 
protocol descriptions have also no dependence on the population size and the PM model preserves uniformity. 
Moreover, PM protocols are anonymous, since there is no room in the state of the agents for unique identifiers 
(though here there is plenty of room on the tapes to create such ids). Uniformity and anonymity are two 
outstanding properties of population protocols [2]. 

4 Stably Computable Predicates 

The predicates that we consider here are of the following form. The input (also called an input assignment) to 
the population is any string x — o\02 ■ ■ ■ o~ n G X* , with n being the size of the population. In particular, by 
assuming an ordering over V, the input to agent i is the symbol a i7 1 < i < n. 3 Any mappings : X* — > {0, 1} 
is a predicate on input assignments. 

Definition 2. A predicate on input assignments p is called symmetric if for every x e X* and any x' which 
is a permutation of x 's symbols, it holds that p(x) = p(x') 

In words, permuting the input symbols does not affect the symmetric predicate's outcome. From each 
predicate p a language L p is derived that is the set of all strings that make p true or equivalently, L p = {x G 
X* | p(x) = 1}. If a predicate p is symmetric, L p is a symmetric language, that is, for each input string 
x e L p any permutation of x's symbols x' also belongs in L p . 

A population configuration C is called output stable if for every configuration C that is reachable from 
C it holds that 0{C (u)) — 0(C(uj) for all u e V, where 0(C(uj) = s (C(u)). In simple words, no agent 
changes its output in any subsequent step and no matter how the computation proceeds. A predicate on 
input assignments p is said to be stably computable by a PM protocol A in a graph family U if, for any input 
assignment x € X*, any computation of A, on any communication graph from U of order |x|, contains an 
output stable configuration in which all agents have p{x) written on their output tape. In what follows, we 
always assume that the graph family under consideration contains only complete communication graphs. 

We say that a predicate p over X* belongs to SPACE(f(n)) (NSPACE(f(n))) if there exists some 
deterministic (nondeterministic, resp.) TM that decides L p using 0(f(n)) space. 

Definition 3. Let SSPACE(f(n)) and SNSPACE(f(n)) be SPACE{f{n))'s and N SPACE{f{n))'s re- 
strictions to symmetric predicates, respectively. 

Definition 4. Let P M S P AC E(f (n)) be the class of all predicates that are stably computable by some PM 
protocol that uses 0{f{n)) space in every agent (and in all of its tapes). 

Remark 1. All agents are identical and do not initially have unique ids, thus, stably computable predicates 
by the PM model on complete communication graphs have to be symmetric. 

In fact, although we provide general definitions, we are mainly interested in the class PLM = PMSPA- 
CE(\og n) and we call a PM protocol a PALOMA protocol (standing for PAssivcly mobile LOgarithmic space 
MAchines) if it always uses O(logn) space. Our main result in this paper is the following exact character- 
ization for PLM: PLM is equal to SN SPACE(n\ogn). In fact, the proof we give easily generalizes and 
also provides an exact characterization for PM SPACE(f{n)), when f(n) = J?(logn): PM SPACE(f(n)) 
is equal to SNSPACE(nf(n)). 

5 Two Examples & a First Inclusion for PLM 
5.1 Multiplication of Variables 

We present now a PM protocol that stably computes the predicate (N c — N a ■ Nb) using O(logn) space (on 
the complete communication graph of n nodes) that is, all agents eventually decide whether the number of 

3 For simplicity and w.l.o.g. we have made the assumption that the input to each agent is a single symbol but our 
definitions can with little effort be modified to allow the agents receive whole strings as input. 
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cs in the input assignment is the product of the number of as and the number of 6s. We give a high-level 
description of the protocol. 

Initially, all agents have one of a, b and c written on the first cell of their working memory (according to 
their sensed value). That is, the set of input symbols is X = S = {a, 6, c}. Each agent that receives input a 
goes to state a and becomes ready for interaction (sets its working flag to 0). Agents in state a and b both 
do nothing when interacting with agents in state a and agents in state b. An agent in c initially creates in its 
working memory three binary counters, the o-counter that counts the number of as, the 6-countcr, and the 
c-counter, initializes the a and b counters to 0, the c-countcr to 1, and becomes ready. When an agent in state 
a interacts with an agent in state c, a becomes a to indicate that the agent is now sleeping, and c does the 
following (in fact, we assume that c goes to a special state c a in which it knows that it has seen an a, and that 
all the following are done internally, after the interaction; finally the agent restores its state to c and becomes 
again ready for interaction): it increases its a-counter by one (in binary), multiplies its a and b counters, 
which can be done in binary in logarithmic space (binary multiplication is in LOG SPACE), compares the 
result with the c-counter, copies the result of the comparison to its output tape, that is, 1 if they are equal 
and otherwise, and finally it copies the comparison result and its three counters to the outgoing message 
tape and becomes ready for interaction. Similar things happen when a b meets a c (interchange the roles of 
a and b in the above discussion). When a c meets a c, the responder becomes c and copies to its output tape 
the output bit contained in the initiator's message. The initiator remains to c, adds the a-counter contained 
in the responder's message to its a-counter, the b and c counters of the message to its b and c counters, 
respectively, multiplies again the updated a and b counters, compares the result to its updated c counter, 
stores the comparison result to its output and outgoing message tapes, copies its counters to its outgoing 
message tape and becomes ready again. When a a, b or c meets a c they only copy to their output tape the 
output bit contained in c's message and become ready again (eg a remains a), while c does nothing. 

Note that the number of cs is at most n which means that the c-counter will become at most [log n\ bits 
long, and the same holds for the a and b counters, so O(logn) memory is required in each tape. 

Theorem 1. The above PM protocol stably computes the predicate (N c — N a ■ N b ) using O(logn) space. 

Proof. Given a fair execution, eventually only one agent in state c will remain, its a-counter will contain 
the total number of as, its 6-counter the total number of &s, and its c-counter the total number of cs. By 
executing the multiplication of the a and b counters and comparing the result to its c-counter it will correctly 
determine whether (N c = N a -Nb) holds and it will store the correct result (0 or 1) to its output and outgoing 
message tapes. At that point all other agents will be in one of the states a, b, and c. All these, again due 
to fairness, will eventually meet the unique agent in state c and copy its correct output bit (which they will 
find in the message they get from c) to their output tapes. Thus, eventually all agents will output the correct 
value of the predicate, having used O(logn) memory. □ 

Corollary 1. The class of semilinear predicates is a proper subset of PLM. 

Proof. PALOMA protocols simulate population protocols and (N c = N a • Nb) € PLM, which is non- 
semilinear. □ 



5.2 Power of 2 

Here, we present a PM protocol that, using O(logn) memory, stably computes the predicate (N\ = 2*), where 
t E Z>o, on the complete communication graph of n nodes, that is, all agents eventually decide whether the 
number of Is in the input assignment is a power of 2. 

The set of input symbols is S = {0, 1}. All agents that receive 1 create a binary 1-counter to their working 
tape and initialize it to 1. Moreover, they create a binary next-powjofl block and set it to 2. Finally, they 
write 1 (which is interpreted as "true" ) to their output tape, and copy the 1-countcr and the output bit to 
their outgoing message tape before going to state 1 and becoming ready. Agents that receive input write 
(which is interpreted as "false") to their output tape, go to state 0, and become ready. Agents in state 
do nothing when interacting with each other. When an agent in state interacts with an agent in state 1, 
then simply copies the output bit of 1. When two agents in state 1 interact, then the following happen: 
the initiator sets its 1-counter to the sum of the responder's 1-counter and its own 1-counter and compares 
its updated value to nextjpow_of2. If it is less than nextjpow_of2 then it writes to the output tape. If it is 
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equal to nextjpow-of2 it sets nextjpow-of2 to 2 • next_pow-of2 and sets its output bit (in the output tape) to 
1. If it is greater than next-pou>-of2, then it starts doubling next_pow-of2 until 1-counter > nextjpow-of2 is 
satisfied. If it is satisfied by equality, then it doubles nextjpow-of2 one more time and writes 1 to the output 
tape. Otherwise, it simply writes to the output tape. Another implementation would be to additionally 
send next-pow_of2 blocks via messages and make the initiator set nextjpowjof2 to the maximum of its own 
and the responder's nextjpow-of2 blocks. In this case at most one doubling would be required. Finally, in 
both implementations, the initiator copies the output bit and the 1-counter to its outgoing message tape (in 
the second implementation it would also copy nextjpow-of2 to the outgoing message tape), remains in state 
1, and becomes ready. The responder simply goes to state I and becomes ready. An agent in state 1 does 
nothing when interacting with an agent in state and vice versa. When an agent in state 1 interacts with 
an agent in state 1, then 1 simply copies the output bit of 1. 

Note that next_pow-oJ2 can become at most 2 times the number of Is in the input assignment, and 
the latter is at most n. Thus, it requires at most |~log2n] bits of memory. Either way, we can delay the 
multiplication until another 1 appears, in which case we need at most [logn] bits of memory for storing 
nextjpow-of2 (the last unnecessary multiplication will never be done). 

6 Assigning Unique IDs by Reinitiating Computation 

In this section, we first prove that PM protocols can assume the existence of unique consecutive ids and 
knowledge of the population size at the space cost of O(logn) (Theorem 2). In particular, we present a 
PM protocol that correctly assigns unique consecutive ids to the agents and informs them of the correct 
population size using only O(logn) memory, without assuming any initial knowledge of none of them. We 
show that this protocol can simulate any PM protocol that assumes the existence of these ids and knows the 
population size. At the end of the section, we exploit this result to prove that SSPACE(nlogn) C PLM. 

Pick any p € SI PLM. Let A be the IPM protocol that stably computes it in O(logn) space. We now 
present a PM protocol X, containing protocol A as a subroutine (see Protocol 1), that stably computes p, by 
also using O(logn) space. X is always executed and its job is to assign unique ids to the agents, to inform 
them of the correct population size and to control *4's execution (e.g. restarts its execution if needed). A, 
when X allows its execution, simply reads the unique ids and the population size provided by X and executes 
itself normally. We first present X and then prove that it eventually correctly assigns unique ids and correctly 
informs the agents of the population size, and that when this process comes to a successful end, it restarts A's 
execution in all agents without allowing non-reinitialized agents to communicate with the reinitialized ones. 
Thus, at some point, A will begin its execution reading the correct unique ids and the correct population 
size (provided by X), thus, it will get correctly executed and will stably compute p. 

We begin by describing I's variables, id is the variable storing the id of the agent (from which A reads the 
agents' ids), sid the variable storing the id that an agent writes in its outgoing message tape in order to send 
it, and rid the variable storing the id that an agent receives via interaction. Recall the model's convention 
that all variables used for sending information, like sid, preserve their value in future interactions unless 
altered by the agent. Initially, id = sid = for all agents. All agents have an input backup variable binput 
which they initially set to their input symbol and make it read-only. Thus, each agent has always available 
its input via binput even if the computation has proceeded, working represents the block of the working 
tape that A uses for its computation and output represents the contents of the output tape, initiator is a 
binary flag that after every interaction becomes true if the agent was the initiator of the interaction and 
false otherwise (this is easily implemented by exploiting the external transition function), ps is the variable 
storing the population size, sps the one used to put it in a outgoing message, and rps the received one. 
Initially, ps = sps = 1. 

We now describe X's functionality. Whenever a pair of agents with the same id interact, the initiator 
increases its id by one and both update their population size value to the greater id plus one. Whenever two 
agents with different ids and population size values interact, they update their population size variables to 
the greater size. Thus the correct size (greatest id plus one) is propagated to all agents. Both interactions 
described above reinitialize the participating agents (restore their input and erase all data produced by the 
subroutine A, without altering their ids and population sizes) . A, runs as a subroutine whenever two agents 
of different ids and same population sizes interact, using those data provided by X. 
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Protocol 1 X 



1: if rid == id then / / two agents with the same ids interact 
2: if initiator == 1 then / / the initiator 

3: id 4— id + 1, sid 4— id / / increases its id by one and writes it in the outgoing message tape 

4: ps 4— id + 1, sps 4- ps 1 1 sets the population size equal to its updated id plus 1 

5: else // the responder 

6: ps 4- id + 2, sps <— ps 

7: end if 

8: // both clear their working block and copy their input symbol into it 

9: // they also clear their output tape 

10: working <— binput, output 

11: else // two agents whose ids differ interact 

12: if rps > ps then / / the one who knows an outdated population size 

13: working <— binput, output <— // is reinitialized 

14: ps <— rps, sps 4- ps / / and updates its population size to the greater value 

15: else if rps == ps then / / they know the same population size 

16: // so they are both reinitialized and can proceed executing A 

17: execute A for 1 step 

18: end if 

19: end if 



Lemma 1. (i) No agent id will get greater than n — 1, and no ps variable will get greater than n. (ii) I 
assigns the ids {0,1, ... ,n — 1} in a finite number of interactions, (in) X sets the ps variable of each agent 
to the correct population size in a finite number of interactions. 

Proof, (i) By an easy induction, in order for an id to reach the value v, there have to be at least v + 1 agents 
present in the population. Thus, whenever an id gets greater than n — 1, there have to be more than n agents 
present, which creates a contradiction. Similar arguments hold for the ps variables 

(ii) Assume on the contrary that it does not. Because of (i), at each point of the computation there will exist 
at least two agents, u, v such that id u = id v . Due to fareness, an interaction between such agents shall take 
place infinitely many times, creating an arbitrarily large id which contradicts (i). 

(hi) The correctness of the id assignment ((i),(ii)) guarantees that after a finite number of steps two agents, 
u, v will set their ps variables to the correct population size (upon interaction in which id u = id v = n — 2). 
It follows from (i) that no agent will have its ps variable greater than n. Fareness guarantees that each other 
agent will interact with u or v, updating its ps to n. □ 

Lemma 2. Given thatX's execution is fair, A's execution is fair as well. 

Proof. Due to the fact that the id-assignment process and the population size propagation are completed in 
a finite number of steps, it suffices to study fairness of *4's execution after their completion. The state of each 
agent may be thought of as containing an Z-subcomponent and an ^-subcomponent, with obvious contents. 
Denote by Ca the unique subconfiguration of C consisting only of the ^-subcomponents of all agents and 
note that some Ca may correspond to many superconfigurations C. Assume that Ca — > C' A and that Ca 
appears infinitely often (since here we consider A's configurations, this '— >' refers to a step of ,4's execution). 
Ca ~ > C' A implies that there exist superconfigurations C , C of Ca, C' a , respectively, such that C — > C (via 
some step of A in the case that Ca C' a ). Due to X's fairness, if C appears infinitely often, then so does C 
and so does C' A since it is a subconfiguration of C. Thus, it remains to show that C appears infinitely often. 
Since Ca appears infinitely often, then the same must hold for all of its superconfigurations. The reasoning 
is as follows. All those superconfigurations differ only in the I-subcomponents, that is, they only differ in 
some variable checks performed by X (after the id-assignment process and the population size propagation 
have come to an end, nothing else is performed by X). But all of them are reachable from and can reach a 
common superconfiguration of Ca in which no variable checking is performed by X, thus, they only depend 
on which pair of agents is selected for interaction and they are all reachable from one another. Since at least 
one of them appears infinitely often then, due to the fairness of X's execution, all of them must also appear 
infinitely often and this completes the proof. □ 
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By combining the above lemmata we can prove the following: 
Theorem 2. PLM = SIPLM. 

Proof. PLM C SIPLM holds trivially, so it suffices to show that SIPLM C PLM. We have presented a 
PLM protocol (protocol 1) that assigns the agents unique consecutive ids after a finite number of interactions 
and informs them of the population size (lemma 1). It follows directly from the protocol that after that point, 
further fair execution of X will result in execution of protocol A which can take into account the existance 
of unique ids. Moreover, execution of A is guaranteed to be fair (lemma 2). □ 

7 Exploring PLM 

7.1 An upper bound 

We first prove that PLM C NSPACE(n\ogn). 

Theorem 3. All predicates in PLM are in the class NSPACE(n\ogn). 

Proof. Let A be a PM protocol that stably computes such a predicate p using O(log n) memory. A population 
configuration can be represented as an n— place vector storing an agent configuration per place, and thus 
uses O(nlogn) space in total. The language L p derived from p is the set of all strings that make p true, that 
is, L p = {x e X* | p(x) = 1}. 

We will now present a nondeterministic Turing Machine Ma that decides L p in O(nlogn) space. To 
accept the input (assignment) x, Ma must verify two conditions: That there exists a configuration C 
reachable from the initial configuration corresponding to x in which the output tape of each agent indicates 
that p holds, and that there is no configuration C reachable from C under which p is violated for some 
agent. 

The first condition is verified by guessing and checking a sequence of configurations. Starting from the 
initial configuration, each time Ma guesses configuration d+i and verifies that d yields Cj+i. This can 
be caused either by an agent transition u, or an encounter (u,v). In the first case, the verification can be 
carried out as follows: Ma guesses an agent u so that d and d+i differ in the configuration of u, and that 
Ci(u) yields Cj+i(u). It then verifies that Cj and d+i differ in no other agent configurations. Similarly, in 
the second case Ma nondeterministically chooses agents u, v and verifies that encounter (u,v) leads to C 
by ensuring that: (a) both agents have their working flags cleared in C, (b) the tape exchange takes place in 
C, (c) both agents update their states according to 7 and set their working flags to 1 in C and (d) that d 
and Cj+i differ in no other agent configurations. In each case, the space needed is O(nlogn) for storing d, 
d+i, plus O(logn) extra capacity for ensuring the validity of each agent configuration in d+i- 

If the above hold, Ma replaces d with d+i and repeats this step. Otherwise, Ma drops d+i- Any 
time a configuration C is reached in which p holds, Ma computes the complement of a similar reachability 
problem: it verifies that there exists no configuration reachable from C in which p is violated. Since N SPACE 
is closed under complement for all space functions > logn (see Immcrman-Szclcpcsenyi theorem, [19], pages 
151 — 153), this condition can also be verified in O(nlogn) space. Thus, L p can be decided in O(nlogn) 
space by some nondeterministic Turing Machine, so L p <E NSPACE(n\ogn). □ 

7.2 A lower bound 

We now prove that SNSPACE(n\ogn) is a subset of PLM. We establish this result by first proving a 
weaker one, that is, that SSPACE(n\ogn) is a subset of PLM. 

Theorem 4. SSPACE(n log n) C PLM. 

Proof. Let p : X* — > {0,1} be any predicate in SSPACE(n\ogn) and M be the deterministic TM that 
decides p by using O(nlogn) space. We construct a PM protocol A that stably computes p by exploiting its 
knowledge of the population size. Let x be any input assignment in X* . Each agent receives its input symbol 
according to x (e.g. u receives symbol x(u)). Now the agents obtain unique ids according to protocol 1. The 
agent that has obtained the unique id starts simulating M. 
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Assume that currently the simulation is carried out by an agent u having the id i u . Agent u uses its 
simulation tape to write symbols according to the transition function of AA . Any time the head of AA moves 
to the left, u moves the head of the simulation tape to the left, pauses the simulation, writes the current 
state of AA to its outgoing message tape, and passes the simulation to the agent v having id i v = (i u — 1) 
mod n. Similarly, any time the head of AA moves to the right, u moves the head of the simulation tape to 
the right, pauses the simulation, writes the current state of AA to its outgoing message tape, and passes the 
simulation to the agent v having id i v = (i u + 1) mod n. In both cases, agent v copies the state of AA to its 
working tape, and starts the simulation. 

Whenever, during the simulation, AA accepts, then A also accepts; that is, the agent that detects AA's 
acceptance, writes 1 to its output tape and informs all agents to accept. If AA rejects, it also rejects. Finally, 
note that A simulates AA not necessarily on input x — (s , s 1; . . . , s n ^i) but on some x' which is a permutation 
of x. The reason is that agent with id i does not necessarily obtain Si as its input. The crucial remark that 
completes the proof is that AA accepts x if and only if it accepts x' , because p is symmetric. 

Because of the above process, it is easy to verify that the k— th cell of the simulation tape of any agent u 
having the id i u corresponds to the n ■ k + i u — th cell of AA. Thus, whenever AA alters I — 0(n ■ logn) tape 
cells, any agent u will alter V = = O(logn) cells of its simulation tape. □ 

We now show how the above approach can be generalized to include nondeterministic Turing Machines. 
Theorem 5. SNSPACE(nlogn) C PLM. 

Proof. By considering Theorem 2, it suffices to show that SNSPACE(n\ogn) is a subset of SIPLM. 
We have already shown that the IPM model can simulate a deterministic TM A4 of O(nlogn) space by 
using O(logn) space (Theorem 4). We now present some modifications that will allow us to simulate a 
nondeterministic TM Af of the same memory size. Keep in mind that Af is a decider for some predicate in 
SNSPACE(n\ogn), thus, it always halts. Upon initialization, each agent enters a reject state (writes to 
its output tape) and the simulation is carried out as in the case of A4 . 

Whenever a nondeterministic choice has to be made, the corresponding agent gets ready and waits for 
participating in an interaction. The id of the other participant will provide the nondeterministic choice to 
be made. One possible implementation of this idea is the following. Since there is a fixed upper bound on 
the number of nondeterministic choices (independent of the population size), the agents can store them in 
their memories. Any time a nondeterministic choice has to be made between k candidates the agent assigns 
the numbers 0, 1, . . . , fc — lto those candidates and becomes ready for interaction. Assume that the next 
interaction is with an agent whose id is i. Then the nondeterministic choice selected by the agent is the one 
that has been assigned the number i mod k. Fairness guarantees that, in this manner, all possible paths in 
the tree representing Af's nondeterministic computation will eventually be followed. 

Any time the simulation reaches an accept state, all agents change their output to 1 and the simulation 
halts. Moreover, any time the simulation reaches a reject state, it is being reinitiated. The correctness of the 
above procedure is captured by the following two cases. 

1. If Af rejects then every agent's output stabilizes to 0. Upon initialization, each agent's output is and 
can only change if Af reaches an accept state. But all branches of Af's computation reject, thus, no accept 
state is ever reached, and every agent's output forever remains to 0. 

2. IfAf accepts then every agent 's output stabilizes to 1. Since Af accepts, there is a sequence of configurations 
S, starting from the initial configuration C that leads to a configuration C in which each agent's output 
is set to 1 (by simulating directly the branch of Af that accepts). Notice that when an agent sets its 
output to 1 it never alters its output tape again, so it suffices to show that the simulation will eventually 
reach C. Assume on the contrary that it does not. Since Af always halts the simulation will be at the 
initial configuration C infinitely many times. Due to fairness, by an easy induction on the configurations 
of S, C will also appear infinitely many times, which leads to a contradiction. Thus the simulation will 
eventually reach C and the output will stabilize to 1. 

□ 

Taking into consideration the above theorems we can conclude to the following exact characterization for 
the class PLM: 



Theorem 6. PLM is equal to SNSPACE(n\ogn). 
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Proof. Follows from Theorems 4, which establishes that SN SPACE '(nlogn) C PLM, and 3, which es- 
tablishes that PLM C N SPACE (nlogn); but for all p e PLM, p is symmetric (Remark 1), thus, 
PLM C SNSPACE(n log n) . □ 



7.3 Simulating Nondeterministic Recognizers 

Here, we generalize the preceding ideas to nondeterministic recognizers of 0(n log n) space. There is a way to 
stably compute predicates in SSP ACE(n log n) even when the corresponding TM N might loop, by carrying 
out an approach similar to the one given above. However, since neither an accept nor a reject state may be 
reached, the simulation is nondeterministically reinitiated at any point that is not in such a state. This choice 
is also obtained by the nondeterministic interactions. For example, whenever the agent that carries out the 
simulation interacts with an agent that has an id that is even, the simulation remains unchanged, otherwise 
it is reinitiated. Notice however that during the simulation, any agent having id i may need to interact with 
those having neighboring ids, so those must not be able to cause a reinitiation in the simulation. 

Correctness of the above procedure is captured by similar arguments to those in the proof of Theorem 
5. If AT never accepts, then no output tape will ever contain a 1, so the simulation stabilizes to 0. If Af 
accepts there is a sequence of configurations S, starting from the initial configuration C that leads to a 
configuration C in which each agent's output is set to 1. Observe that this is a "good" sequence, meaning 
that no reinitiations take place, and, due to fairness, it will eventually occur. 



8 Space Hierarchy of the PM Model 

In this section we study the behaviour of PM model for various space bounds. 

Theorem 7. For any function f : N N, any predicate in PMSPACE(f(n)) is also in SNSPACE(2^ ( 
f(n) +logn)). 

Proof. Take any p e PMSPACE(f(n)). Let A be the PM protocol that stably computes predicate p 
in space 0{f(n)). L p = {(si, s 2 , ■ . ■ , s n ) s, £ 1 for alii G {1, . . . , n} and p{s\, s 2 , . . . , s n ) = 1} is the 
language corresponding to p (X C S* is the set of input strings) . We describe a NTM Af that decides L p in 
g(n) = 0(2^")(/(n)+Iogn)) space. 

Note that each agent uses memory of size 0(f(n)). So, by assuming a binary tape alphabet r = {0, 1} 
(the alphabet of the agents' tapes), an assumption which is w.l.o.g., there are 2°(fW) different agent con- 
figurations (internal configurations) each of size 0(f(n)). Af stores a population configuration by storing all 
these agent configurations, consuming for this purpose 0(f(n)2^ n ') space, together with a number per agent 
configuration representing the number of agents in that agent configuration under the current population 
configuration. These numbers sum up to n and each one of them requires O(logn) bits, thus, 0(2^") logn) 
extra space is needed, giving a total of 0(2^ n '(f(n) + logn)) space needed to store a population configura- 
tion. The reason that such a representation of population configurations suffices is that when k agents are in 
the same internal configuration there is no reason to store it k times. The completeness of the communication 
graph allows us to store it once and simply indicate the number of agents that are in this common internal 
configuration, that is, k. 

Now Af does what the deterministic TM of the Theorem 3 does. The main difference is that it now store 
the population configurations according to the new representation we discussed above. □ 

Theorem 8 (Symmetric Space Hierarchy Theorem). For any function f : N — > N, a symmetric 
language L exists that is decidable in 0(f(n)) (non) deterministic space but not in o(/(n)) (non) deterministic 
space. 

Proof. Follows immediately from the unary (tally) separation language presented in [17] and the fact that 
any unary language is symmetric. □ 



Corollary 2. For any function f(n) = o(logn) it holds that PMSPACE(f(n)) C SNSPACE(nf(n)). 
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Proof. By considering Theorem 7, the symmetric space hierarchy theorem (Theorem 8) it suffices to show 
that 2^ n \f{n) + logn) = o(n/(n)) for f(n) — o(logn). We have that 

2 /( " ) (/(n) + logn) = 2°( los ")0(logn) = o(n)0(logn), 

which obviously grows slower than nf(n) — n ■ o(logn). □ 

So, for example, if f(n) = log logn, then PMSPACE(\og log n) C SNSPACE(log 2 n) which is strictly 
smaller than SNSPACE(nf(n)) — SNSPACE(n\og\ogn) by the symmetric space hierarchy theorem. 
Another interesting example is obtained by setting f(n) — c. In this case, we obtain the S N S P AC Eilogn) = 
SNL (for Symmetric NL) upper bound for population protocols of Angluin et al. [2] (in [3] they obtained 
a better bound which is the semilinear predicates and constitutes an exact characterization for population 
protocols). 

Although the above upper bounds are relatively tight for /(n) < logn space functions, the bounds for 
f(n) > logn have worse behavior in relation to the nf(n). In the theorem below, tighter upper bounds 
are shown by working with a different representation of population configurations that holds each agent's 
internal configuration explicitly. 

Theorem 9. For any function f(n) it holds that PM SPACE(f(n)) is a subset of SNSPACE(nf(n)). 

Proof. This proof is similar to the proof establishing that PLM is a subset of SN — SPACE(nlogn) in 
Theorem 3. It can be easily shown, using similar arguments to those of the previously stated proof, that all 
predicates in PMSPACE(f(n)) are also in S N S P AC E{nf {n)) . In particular, there is a nondeterministic 
TM M of space 0(nf(n)) that can decide a language corresponding to any predicate p G PMSPACE(f(n)). 
A4 holds the internal configurations of the n agents each of which needs 0(/(n)) space and therefore each 
configuration (of the entire population) requires 0{nf{n)) space in Ad's tape. M starts with the initial 
configuration C, guesses the next C and checks whether it has reached a configuration in which p holds. 
When M. reaches such a configuration C it computes the complement of a similar reachability problem: 
it verifies that there exists no configuration reachable from C in which p is violated. This condition can 
also be verified in 0(nf(n)) space since N SPACE is closed under complement for all space functions 
g(n) = J?(logn) (Immerman-Szelepcsenyi theorem [19], pages 151-153). Note that for any reasonable function 
f(n), g(n) = nf(n) > logn, as required by the Immerman-Szelepcsenyi theorem. □ 

The upper bounds shown above are obviously better for functions /(n) = J?(logn) than those presented 
by Theorem 7. Note however, that Theorem 9 also holds for f(n) = o(logn) and for those space constructiblc 
functions the upper bounds are worse than those of Theorem 7. In order to realize that consider the function 
f(n) = c (the memory of each agent is independent of the population size, thus this is the basic PP model). 
According to Theorem 9 the upper bound is the trivial SNSPACE(n), whereas the Theorem 7 decreases the 
upper bound to S N S PAC E(log n). This behavior is expected due to the configuration representation of the 
population used by those theorems. When the configuration is stored as n-vector where each element of the 
vector holds the internal configuration of an agent (representation used in Theorem 9) then as the memory 
size grows the additional space needed is a factor n of that growth. On the other hand, when a configuration is 
represented as a vector of size equal to the number of all possible internal configurations where each element 
is the number of agents that are in the corresponding internal configuration (as in Theorem 7) then the size 
of the vector grows exponentially to the memory growth. Therefore tighter upper bounds are obtained by 
Theorem 9 for functions f(n) = J2(logn) and by Theorem 7 for f(n) = o(logn). Note that for f(n) = logn 
the bounds by both theorems are the same. 

The next theorem shows that for space functions f(n) = J7(logn), the PM model can simulate a nonde- 
terministic TM of space 0(nf(n)) using 0(f(n)) in each agent. The intuition here is that with at least logn 
memory in each agent we can always assign unique ids and propagate the size of the population, as shown 
in Section 6, and therefore we can always organize the population into a nondeterministic TM where each 
of the n agents provides C(/(n)) space for the simulation. 

Theorem 10. For any f(n) = f2(logn) it holds that SNSPACE(nf(n)) is a subset PMSPACE(f(n)). 

Proof. In Theorem 5 a nondeterministic TM of O(nlogn) space is simulated by a PALOMA protocol pro- 
vided that all agents know the population size and have unique ids. In addition in Section 6 a construction 
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is provided that allows the PALOMA protocols to assume ids and population size knowledge. The same 
construction can be used for the PM protocols which use C(/(n)) space for f(n) = J7(logn). The reason 
is that in this space every agent can store a unique id as well as the population size. Therefore the same 
protocol X of Theorem 2 can be used here. □ 

From the previous two theorems we get a generalization of the exact characterization for all PMSPACE( 
/(n)), when f{n) = J?(logn). It is formally stated as: 

Theorem 11. For any f{n) = tf(logn) it holds that PMSPACE{f(n)) = SNSPACE(nf(n)). 

Proof. It holds from theorems 9 and 10. □ 

Thus, by considering together Corollary 2 and Theorem 11 we obtain the following corollary. 

Corollary 3. f(n) = 0(\ogn) acts as a threshold for the computational power of the PM model. 

Considering the above analysis and Corollary 3 it is worth noting that the PALOMA protocols, seem to 
belong to a golden section between realistic requirements, w.r.t. implementation, and computational power 
offered. 

Theorem 12 (Space Hierarchy Theorem of the PM model). For any two functions f,g : N — > 
N, where f(n) = Qilogn) and g(n) = o(/(n)) there is a predicate p in PMSPACE(f(n)) but not in 
PMSPACE(g(n)). 

Proof. From Theorem 8 we have that for any such functions /, g there is a language L £ SNSPACE(nf(n)) 
so that L SNSPACE(ng(n)). From Theorem 11 we have that PMSPACE(f(n)) = SNSPACE(nf(n)), 
therefore pl G PMSPACE(f{n)) (where pl is the symmetric predicate that corresponds to the sym- 
metric language L). We distinguish two cases. If g(n) = J7(logn) then from Theorem 11 we have that 
PMSPACE(g(n)) = SNSPACE(ng(n)) and so L £ PMSPACE(g(n)) or equivalently p L £ PMSPACE( 
g{n)). If g(n) = o(logn) then from Corollary 2 we have that PMSPACE(g(n)) C SNSPACE(ng(n)) 
C SNSPACE(nf(n)) = PMSPACE(f(n)). □ 

9 A threshold in the Computability of the PM Model 

In this section we explore the computability of the PM model when the protocols use o(log log n) space. We 
show that this bound acts as a threshold constraining the corresponding protocols to compute only semilinear 
predicates. 

Let SEMILINEAR denote the class of the semilinear predicates. We will prove that SEMILINEAR = 
PMSPACE(f(n)) when f(n) — o(loglogn). Moreover, we will prove that this bound is tight, so that 
SEMILINEAR C PMSPACE(f(n)) when f(n) = O(loglogn). 

9.1 o(loglogn) Threshold 

Definition 5. Let A be a PM protocol executed in a population V of size n. Define agent configuration 
graph, Ra,v = {U, W, F}, a graph such that: 

— U is the set of the agent configurations that can occur in any execution of A such that the working flag 
is set to 0. 

— W is the set of edges (u,v), u,v G U so that an agent configuration v can occur from a configuration u 
via a single interaction. 

— F : W — > U x {i,r} so that when an agent k being in configuration u enters configuration v via a single 
interaction with an agent being in configuration w, and k acts as x e {i,r} (initiator-responder) in the 
interaction, then F((u,v)) = {w,x}, while F((u,v)) = in any other case. F is a function since we only 
deal with deterministic protocols. 



14 



Ioannis Chatzigiannakis, Othon Michail, Stavros Nikolaou, Andreas Pavlogiannis, and Paul G. Spirakis 



In other words, U contains the configurations that an agent may enter when we don't take into consid- 
eration the ones that correspond to internal computation, while W defines the transitions between those 
configuration through interactions defined by F. Notice that in general, Ra,v depends not only on the 
protocol A, but on the population V. We call a u £ U initial node iff it corresponds to an initial agent 
configuration. 

Because of the uniformity property, we can deduce the following theorem: 

Theorem 13. Let Ra,v, Ra.v be two agent configuration graphs corresponding in a protocol A for two 
different populations V , V of size n and n' respectively, with n < n' . Then, there exists a subgraph R* of 
Ra,v such that R* = Ra,v> which contains all the initial nodes of Ra.v ' 

Proof. Indeed, let V{, V 2 ' be a partitioning of V such that V{ = V, and observe the agent configuration 
graph that yielded by the execution of A in V{. Since both populations execute the same protocol A, and 
V[ = V, Ra,v = Ra,v{- Moreover, since the initial nodes are the same for both populations, they must be 
in Ra.v[ - Finally, Ra,v{ is a subgraph of Ra,v, as V{ C V , and the proof is complete. □ 

The above theorem states that while we explore populations of greater size, the corresponding agent 
configuration graphs are only enhnanced with new nodes and edges, while the old ones are preserved. 
Given an agent configuration graph, we associate each node a with a value r(a) inductively, as follows: 

Base Case For any initial node a, r(a) — ri n u = 1. 

Inductive Step For any other node a, r(a) = min(r(b) + r(c)) such that a is reachable from b through an 
edge labeled as c, and b, c have already been assigned an r value. 

Lemma 3. Let Ra,v — {U, W, F} be an agent configuration graph. Every node in Ra,v get associated with 
an r value. 

Proof. Assume for the shake of the contradiction that there is a maximum, non empty set of nodes U' C U 
such that Vt; € U', v does not get associated with an r value. Then B = U — U' , and C — (B,W) defines 
a cut, with all the initial nodes being in B. We examine any edge (u,v) labeled as (w, x) that crosses the 
cut. Obviously u G B and v G U' , and u is associated with a value r(u). Since v is not associated with any 
r value, the same must hold for node w (otherwise r(v) = r(u) + r(w)). We now examine the first agent c 
that enters a configuration corresponding to some v G U' . Because of the above observation, this could only 
happen through an interaction with an agent being in a configuration that is also in U' which creates the 
contradiction. □ 

Note that for any given protocol and population size, the r values are unique since the agent configuration 
graph is unique. The following lemma captures a bound in the r values when the corresponding protocol 
uses f(n) = o(loglogn) space. 

Lemma 4. Let r max ^i be the i—th greatest r value associated with any node in an agent configuration graph. 
For any protocol A that uses f(n) = o(loglogn), there exists a n such that for any population of size n > n 
r < 2 

1 max \ 2 * 

Proof. Since f(n) = o(loglogn), lim n ^oo x J^ gn = 0, so lim^oo lo ^'°^" = oo and lim^oo |jf^- = oo. It 

follows from the last equation that for any positive integer M, there exists a fixed no such that ^f^y > M 
for any n > n . 

Fix any such n and let k — \U\ < 2^™-' in the corresponding agent configuration graph. Since any node is 
associated with an r value, there can be at most k different such values. Now observe that r max < 2-r max -i < 
■ ■ ■ < 2 k ■ r mlt < 2 2f (") < 2t < V™ < f for n > max(n a , 2) and M > 2. □ 

The following lemma proves that the r values correspond to some reachability properties in the agent 
configuration graph. 

Lemma 5. Let Q(a) be the following property: Given a node a in an agent configuration graph, there exists 
a population of size r(a) and a fair execution of the corresponding protocol that will lead an agent to the 
configuration a. Q(a) holds for any node of the agent configuration graph. 



Passively Mobile Communicating Machines that Use Restricted Space 



15 



Proof. We prove the above lemma by generalized induction in the r values. 
Base Case: Q(a) obviously holds for any initial node u, since ri n a = 1. 

Inductive Step: We examine any non initial node u that has been associated with a value r(u) = r(a)+r(b), 
for some a, b. The inductive hypothesis guarantees that Q(a) and Q(b) hold. Then, a population of size 
r(a) + r(b) can lead two agents to configurations a and b independently. Then an interaction between 
those agents will take one of them to configuration u, so Q(u) holds too. 

□ 

Lemmata 4 and 5 lead to the following: 

Lemma 6. For any protocol A that uses f(n) — o(loglogn) there exists a fixed hq such that for any popu- 
lation of size n > n and any pair of agent configurations u,v, the interaction (u,v) can occur. 

Proof. Indeed, because of the lemma 4, there exists a hq such that for any n > no, r(a) < ^ for any a. With 
that in mind, lemma 5 guarantees that in any such population any interaction (u, v) since any of the agent 
configurations u, v can occur independently, by partitioning the population in two subpopulations of size ^ 
each. □ 

We can complete our proof with the following theorem: 

Theorem 14. Any PM protocol A that uses f(n) — o(loglogn) can only compute semilinear predicates. 

Proof. Because of the uniformity constraint, A can be executed in any population of arbitrary size. We 
choose a fixed n as defined in lemma 4 and examine any population L of size n > no. Let Ra,l be the 
corresponding agent configuration graph. Let V be any population of size n' > n and Ra,l> the corresponding 
agent configuration graph. Because of the theorem 13, Ra,l> contains a subgraph K , such that K — Ra.l, 
and the initial nodes of Ra,l> are in K. Let U* = U' — U, and k the first agent configuration that appears 
in L' such that k e U* through an interaction (u,v)(k can't be an initial configuration, thus it occurs 
through some interaction). Then u,v G U, and the interaction (u,v) can occur in the population L too 
(lemma 6), so that k e U, which refutes our choice of k creating a contradiction. So, U* = 0, and the set 
of agent configurations does not change as we examine populations of greater size. Since the set of agent 
configurations remains unchanged, the corresponding predicate can be computed by the Population Protocol 
model, thus it is semilinear. □ 

In practice, the above theorem guarantees that for any protocol that uses only f(n) = o(loglogn) space 
in each agent, there exists a population of size no in which it stops using extra space. Since no is fixed, we 
can construct a protocol based on the agent configuration graph which uses constant space 4 , and thus can 
be executed in the Population Protocols Model. 

So far, we have stablished that when f(n) = o(loglogn), PMSPACE{f{n)) C SEMILINEAR. Since 
the inverse direction holds trivially, we can conclude that PM SPACE(f(n)) = SEMILINEAR. 



9.2 The Logarithmic Predicate 

We will now present the non semilinear logarithmic predicate, and devise a PM protocol that computes it 
using O (log log n) space in each agent. 

We define the logarithmic predicate as follows: During the initialization, each agent receives an input 
symbol from X = {a, 0}, and let N a denote the number of agents that have received the symbol a. We 
want to compute whether log N a = t for some arbitrary t. We give a high level protocol that computes this 
predicate, and prove that it can be correctly executed using O (log log n) space. 

Each agent u maintains a variable x Ul and let out u be the variable that uses to write its output. Initially, 
any agent u that receives a as his input symbol sets x u = 1 and out u — 1, while any other agent v sets 
x v — and out v = 1. 

Our main protocol consists of two subprotocols, A and B, that are executed concurrently. Protocol A 
does the following: whenever an interaction occurs between two agents, u, v, with u being the initiator, if 



4 Notice that the agent configuration graph can be viewed as a deterministic finite automaton. 
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x u — x v > 0, then x u = x u + 1 and x v = 0. Otherwise, nothing happens. Protocol B runs in parallel, and 
computes the semilinear predicate of determining whether there exist two or more agents having x > 0. If 
so, it outputs 0, otherwise it outputs 1. Observe that B is executed on stasbilizing inputs, as the x— variables 
fluctate before they stabilize to their final value. However, it is well known that the semilinear predicates are 
also computable under this constraint ([3]). 

Lemma 7. The main protocol uses O(loglogn) space. 

Proof. As protocol B computes a semilinear predicate, it only uses 0(c) space, with c being a constant. 
To examine the space bounds of A, pick any agent u. We examine the greatest value that can be assigned 
to the variable x u . Observe that in order for x u to reach value k, there have to be at least 2 pre-existsing 
x— variables with values k — 1. Through an easy induction, it follows that there have to be at least 2 k 
pre-existing variables with the value 1. Since 2 k < N a , k < \ogN a < logn, so x u is upper bounded by logn, 
thus consuming O (log logn) space. □ 

Lemma 8. For every agent u, eventually out u — 1 if \ogN a — t for some arbitrary t, and out u = 
otherwise. 

Proof. Indeed, the execution of protocol B guarantees that all agents will set out = 1 iff eventually there 
exists only one agent u that has a non-zero value assigned in x u . Assume that x u — k for some k. Then, 
because of the analysis of lemma 8 during the initialization of the population will exist 2 k x— variables set 
to 1. Since each of those variables corresponds to one a assignement, N a — 2 k =>■ \ogN a = k. On the other 
hand, if the answer of the protocol is then it means that there are t > 1 agents in the population with 
x— variables set to different values x\,X2,- •• ,xt otherwise they could have effective interactions with each 
other. Therefore, there should have initially existed 2 Xl ~ 1 + 2 X2 ~ 1 + • • • + 2 Xt ~ 1 agents with input a. This, 
however, means that N a ^ 2 k for any k since each number can be uniquely expressed as a sum of powers of 
2. Thus the protocol correctly outputs 0. □ 

Thus, we have presented a non-semilinear predicate that can be computed by a PM protocol using 
O(loglogn) space. Combining this result with the one presente in the previous subsection, we obtain the 
following theorem: 

Theorem 15 {Threshold Theorem). SEMILINEAR = PMSPACE{,f(n)) when f(n) = o(loglogn). 
Moreover, SEMILINEAR C PMSPACE(f(n)) when f(n) = O(loglogn). 

10 Conclusions - Future Research Directions 

We proposed the PM model, an extension of the PP model [2], in which the agents are communicating TMs. 
We mainly focused on studying the computational power of the new model when the space used by each agent 
is bounded by a logarithm of the population size. Although the model preserves uniformity and anonymity, 
interestingly, we have been able to prove that the agents can organize themselves into a nondeterministic TM 
that makes full use of the agents' total memory (i.e. of O(nlogn) space). The agents are initially identical 
and have no global knowledge of the system, but by executing an iterative reinitiation process they are able to 
assign unique consecutive ids to themselves and get informed of the population size. In this manner, we showed 
that PLM, which is the class of predicates stably computable by the PM model using (9 (logn) memory, 
contains all symmetric predicates in NSPACE(n\ogn). Moreover, by proving that PLM is a subset of 
N SPACE (nlogn), we concluded that it is precisely equal to the class consisting of all symmetric predicates 
in NSPACE(n\ogn). We then generalized this to SNSPACE(nf(n)) when the agents use C(/(n)) memory, 
for all space functions /(n) = J?(logn). We also explored the behavior of the PM model for space bounds 
f(n) = o(logn) and proved that SEMILINEAR = PMSPACE(f(n)) when f(n) = o(loglogn). Finally, 
we showed that this bound is tight, that is, SEMILINEAR C PMSPACE{f{n)) when /(n) = O(loglogn). 

Many interesting questions remain open. Is the PM model fault-tolerant? What preconditions are needed 
in order to achieve satisfactory fault-tolerance? What is the behavior of the model when the agents use 
0(f(n)) memory, where /(n) = o(logn) and /(n) = J?(loglogn). 
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